Citations in the Digital Library of Classics: Extracting Canonical References by Using Conditional Random Fields

نویسندگان

  • Matteo Romanello
  • Federico Boschetti
  • Gregory Crane
چکیده

Scholars of Classics cite ancient texts by using abridged citations called canonical references. In the scholarly digital library, canonical references create a complex textile of links between ancient and modern sources reflecting the deep hypertextual nature of texts in this field. This paper aims to demonstrate the suitability of Conditional Random Fields (CRF) for extracting this particular kind of reference from unstructured texts in order to enhance the capabilities of navigating and aggregating scholarly electronic resources. In particular, we developed a parser which recognizes word level n-grams of a text as being canonical references by using a CRF model trained with both positive and negative examples.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Citation analysis of graduate Dental thesis references: Before and after an intervention

Background: Introduction of Iranian National Medical Digital Library (INLM) was a huge investment during several years ago. The aim of this study was to discover the effectiveness of this scientific intervention by examination of citation pattern among graduate dental thesis during before and after of INLM accessibility. Methods: This analytical study was conducted among all of graduate dental ...

متن کامل

Conditional Random Fields for Airborne Lidar Point Cloud Classification in Urban Area

Over the past decades, urban growth has been known as a worldwide phenomenon that includes widening process and expanding pattern. While the cities are changing rapidly, their quantitative analysis as well as decision making in urban planning can benefit from two-dimensional (2D) and three-dimensional (3D) digital models. The recent developments in imaging and non-imaging sensor technologies, s...

متن کامل

Citation analysis of the articles published in Scientific and Research Journal of Oceanography

Background and aim: The scientific journals are a valid method for communication of update information and a link among various fields of science through citation. The aim of this study was to investigate the citation of the articles of 28 issues published in Scientific and Research Journal of Oceanography (JOC). Material and methods: This study investigated the citation of 290 articles publish...

متن کامل

A citation analysis of specialty dissertations in Hormozgan University of medical sciences

Introduction: Citation analysis is a branch of bibliometrics in which information needs of users of a particular library can be assessed and therefore it can be used as a tool in a library collection building. This study was conducted on cited references of specialty dissertations in order to determine the reference type, their half life and language. Methods: Citation analysis on all the 55 ...

متن کامل

Annotated Bibliographical Reference Corpora in Digital Humanities

In this paper, we present new bibliographical reference corpora in digital humanities (DH) that have been developed under a research project, Robust and Language Independent Machine Learning Approaches for Automatic Annotation of Bibliographical References in DH Books supported by Google Digital Humanities Research Awards. The main target is the bibliographical references in the articles of Rev...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009